Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
También | 1889 | 81 | 1 | 81.0000 |
Rica | 11242 | 210 | 3 | 70.0000 |
Según | 1700 | 66 | 1 | 66.0000 |
Su | 1552 | 119 | 4 | 29.7500 |
Sus | 376 | 29 | 1 | 29.0000 |
Otro | 415 | 25 | 1 | 25.0000 |
Estos | 734 | 50 | 2 | 25.0000 |
Esa | 470 | 25 | 1 | 25.0000 |
Esto | 1218 | 73 | 3 | 24.3333 |
Durante | 717 | 23 | 1 | 23.0000 |
close | 1514 | 21 | 1 | 21.0000 |
folios | 1043 | 93 | 5 | 18.6000 |
Ante | 358 | 18 | 1 | 18.0000 |
cuales | 1387 | 52 | 3 | 17.3333 |
Sábado | 359 | 33 | 2 | 16.5000 |
Aunque | 772 | 32 | 2 | 16.0000 |
Los | 8495 | 606 | 38 | 15.9474 |
Viernes | 293 | 30 | 2 | 15.0000 |
al. | 299 | 30 | 2 | 15.0000 |
El | 26712 | 1338 | 92 | 14.5435 |
Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
Réplica | 1077 | 1 | 25 | 0.0400 |
dentro | 2095 | 3 | 74 | 0.0405 |
llena | 227 | 1 | 16 | 0.0625 |
Amueblado | 147 | 1 | 15 | 0.0667 |
de. | 430 | 1 | 13 | 0.0769 |
capaz | 295 | 1 | 13 | 0.0769 |
contará | 207 | 1 | 12 | 0.0833 |
relacionadas | 1398 | 2 | 24 | 0.0833 |
miles | 300 | 1 | 11 | 0.0909 |
Hits | 159 | 1 | 11 | 0.0909 |
Fincas | 208 | 2 | 21 | 0.0952 |
compuesto | 109 | 1 | 10 | 0.1000 |
convertirse | 192 | 1 | 10 | 0.1000 |
Date | 3313 | 1 | 10 | 0.1000 |
CRC | 296 | 2 | 19 | 0.1053 |
índice | 159 | 1 | 9 | 0.1111 |
Pulse | 133 | 1 | 9 | 0.1111 |
Ing | 152 | 1 | 9 | 0.1111 |
alrededor | 665 | 2 | 18 | 0.1111 |
concepto | 594 | 2 | 18 | 0.1111 |
In this subsection, we compute the ratio of the number of right neighbors and the number of left neighbors. Again, we look for words with extreme ratios:
Data for first table:
select word,w.freq,aa.cnt, bb.cnt,aa.cnt/bb.cnt as r from words w, (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where w_id=aa.w1_id and aa.w1_id=bb.w2_id order by r desc limit 20;
Diagram data:
select aa.cnt, bb.cnt from (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where aa.w1_id=bb.w2_id;
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.2 Number of NN co-occurrences vs. Frequency II